Intramolecular Fluorescence Correlation Spectroscopy in a Feedback Tracking 

Microscope 
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We derive the statistics of tlie signals generated by shape fluctuations of large molecules studied 
by feedback tracking microscopy. We account for the influence of intramolecular dynamics on the 
response of the tracking system, and derive a general expression for the fluorescence autocorrelation 
function that applies when those dynamics are linear. We show that tracking provides enhanced 
sensitivity to translational diffusion, molecular size, heterogeneity and long time-scale decays in com- 
parison to traditional fluorescence correlation spectroscopy. We demonstrate our approach by using 
a three-dimensional tracking microscope to study genomic A-phage DNA molecules with various 
fluorescence label conflgurations. 



I. INTRODUCTION 

Fluorescence Correlation Spectroscopy (FCS) is a 
method for optically measuring local concentration fluc- 
tuations of fluorescence-labeled molecules in solution[l- 
4]. The fluorescence autocorrelation function — referred 
to as the FCS curve — contains signatures of the dy- 
namic properties of those molecules, such as diffusion co- 
efflcients or reaction rates, which are inferred by com- 
parison of the curve to theoretical predictions. Mod- 
ern approaches to FCS use tightly focused laser beams 
and confocal detection to probe diffraction-limited sam- 
ple volumes, are sensitive enough to measure fluorescence 
from single dye-labeled molecules, and can resolve fluctu- 
ations on time-scales as fast as the fluorescence lifetimes 
of the dyes[5, 6]. These methods have found success in a 
wide range of applications in biology and chemistry. 

One relatively new application of FCS is the study 
of the intramolecular dynamics of large polymer chains. 
Such motions were first described theoretically over 50 
years ago [7, 8], but were only coarsely probed experi- 
mentally because of the insensitivity of the experimental 
methods available at the time. As first demonstrated in 
2003, FCS is sensitive to the internal motions of polymers 
that are large relative to the focused waist of the probe 
laser [9]. These initial measurements suggested that the 
internal dynamics of large double-stranded DNA (ds- 
DNA) molecules are dominated by the stiffness of the 
polymer chain and by hydrodynamic couplings between 
spatially proximate polymer segments. Later measure- 
ments conflicted with these conclusions[10], however, and 
the resulting controversy has yet to be fully resolved de- 
spite several iterations of experimental and theoretical 
improvements [11-13]. 

In a recent paper[14], we argue that FCS is not a suffi- 
ciently sensitive technique for characterizing the internal 
dynamics of large polymer molecules because the the- 
oretical FCS curves contain too many free parameters. 
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resulting in underdetermined numerical fits to the data. 
We instead used a technique based on feedback track- 
ing microscopy, a technology that has been developed 
by our and other groups over the past few years[15-22] 
(and reviewed recently in [23]) by which a feedback sys- 
tem tracks the translational motion of a single molecule 
and keeps it in the focus of the microscope. In the par- 
ticular variant of feedback microscopy developed by our 
group, we compute the autocorrelation function of the 
fluorescence measured from the tracked molecule and an- 
alyze it in a manner analogous to traditional FCS [24-26]. 
This tracking-FCS (tFCS) approach generates statistics 
that are related to traditional FCS (which we will re- 
fer to as stationary FCS for clarity in this paper), but 
provides enhanced sensitivity to both translational and 
intramolecular motion. Our tFCS measurements re- 
vealed that the translational statistics and radius of gy- 
ration of large dsDNA molecules were together consistent 
with strong hydrodynamic interactions, as commonly ex- 
pected, but that the intramolecular relaxation statistics 
are surprisingly not consistent with the Zimm polymer 
model[8, 14, 27]. 

This paper provides the mathematical foundation for 
intramolecular tracking-FCS. In section II we compute 
the tFCS curve for a molecule exhibiting conformation 
fluctuations. We account for the effect of those fiuctua- 
tions on the response of the tracking system, as well as 
the systematic artifacts that the tracking system adds to 
the tFCS curve. We focus in particular on molecules de- 
scribed by linear dynamical models, both because these 
include the standard polymer dynamics theories and be- 
cause it is possible to derive closed-form expressions for 
the tFCS curve for such models. We illustrate character- 
istics of the intramolecular tFCS approach with a sim- 
ple example model for the molecular motion. In Section 
III we demonstrate the application of tFCS to the study 
of the intramolecular motion of fluorescence-labeled ge- 
nomic A-phage DNA using a variation on the appara- 
tus we described in [28]. Our measurements are consis- 
tent with predictions for molecules with three different 
label configurations; furthermore, they reveal sensitivity 
to heterogeneity between molecules with random labeling 
schemes with a noise floor consistent with the predicted 
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FIG. 1: Typical experimental scenario. The gray curve rep- 
resents a molecule sparsely labeled with red and cyan dyes. 
The circular violet region represents the tFCS probe beam 
(the larger tracking beam is not illustrated). Vectors Xcm, 
Xm and y relative to the arbitrary origin O represent the cen- 
ter of mass of the molecule, position of tFCS dye m, and beam 
position, respectively. Dye positions relative to Xcm (r™ for 
tFCS dyes and z,i for tracking dyes) are used in the text for 
convenience. 



photon-counting noise level. 



II. THE INTRAMOLECULAR TRACKING-FCS 
CURVE 

We begin with a description of the experimental sce- 
nario we are concerned with, illustrated in Figure 1. A 
molecule is labeled with a collection of two types of flu- 
orescent dyes, distinguished in the figure by their colors. 
The red dyes are excited by the tracking laser to deter- 
mine the position of the molecule, and the cyan dyes are 
excited by a probe laser for intramolecular tFCS mea- 
surements. The tracking system reacts to the motion of 
the molecule by adjusting the position of the laser beams 
to follow the center of mass of the tracking dyes. Funda- 
mental localization noise induces tracking errors so the 
beams do not follow the target position precisely, as em- 
phasized by their off-center displacement in the figure. 
The tFCS dyes emit fluorescence bursts whenever the 
intramolecular motion causes them to drift through the 
probe laser beam. The goal of this section is to calculate 
the statistics of this tFCS signal as determined by the 
dynamics of the tracking system and the statistics of the 
molecule's motion. 

We begin with the standard definition of the FCS 
curve. 
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where (•) denotes an average over time, equivalent to an 
ensemble average due to the ergodicity of the fluorescence 
signal. In this paper we deal with g2{T) for a collection 
of M dyes moving along a set of trajectories {x*,^}, each 
with its own possibly distinct dynamics (we will use this 
superscript notation as shorthand to indicate the time 
dependence of various quantities throughout this paper) . 
We define the brightness hm of dye m so that hmi'o is the 
fluorescence rate we detect when that dye is excited by a 
laser with intensity We can then write lit) in terms 
of the dye positions and brightnesses and the spatially- 
varying laser intensity 0(x): 
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where y* is the position of the laser beam over time, 
the case where y* is constant, this I{t) is exactly that of 
stationary FCS. 

We express Eq. 2 in terms of the Fourier transform of 
(/)(x), (/)(k), in order to facilitate calculations. The un- 
normalized autocorrelation G(r) = {I{t)I{t + t)) is then 
given by 
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where we have simplified notation by defining the con- 



catenated vectors k 
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and x„ 



and the product of the beam profiles $(k) 



(/)(— k)(/)(— k'). A similar expression exists for the time 
average {I{t)). 

Equations 1 and 3 together define g2{T) in terms of 
the properties of the tracked molecule and tracking sys- 
tem; all that remains is to insert appropriate models for 
these dynamics. In the general case Eq. 3 cannot be 
simplified because x^/x and y are highly correlated. The 
tracking system responds to both the center-of-mass and 
intramolecular motion of the molecule, and these mo- 
tions may be correlated with each other. For example, a 
molecule's shape fluctuations may couple into variations 
in its apparent translational diffusion coefficient [13], and 
spatial inhomogeneities within the sample may cause the 
intramolecular dynamics to depend on the center-of-mass 
position. As a consequence, the only general approach to 
calculating the tFCS curve relies on solving for the joint 
statistics of the laser and the dyes from a set of coupled 
equations of motion. 



A. Linear models in homogeneous samples 

Linear models such as the Rouse and Zimm polymer 
models predict Gaussian statistics for the molecular mo- 
tion because the Brownian force is Gaussian. Our feed- 
back system is linear as well, so it exhibits Gaussian 
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statistics in response to the molecular niotion[26]. If we 
assume that Xm/j a-nd y are jointly Gaussian random vari- 
ables, we can make a fairly dramatic simplification to the 
average in Eq. 3 (see supporting information): 
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where we have adopted the notation for the outer product 
= xx""" and have assiimed that x„,^j — y has zero mean, 
a condition that is enforced by the tracking system. This 
simplification is helpful because it separates the average 
into second-order correlation matrices that are relatively 
easy to calculate. 

Linear models further predict that the center of mass 
motion and intramolecular motion are uncorrelated, pro- 
vided the absence of any spatially inhomogeneous fields. 
These models express the dynamics using an eigenfunc- 
tion expansion in which the center-of-mass motion ap- 
pears as the zeroth-order mode, while the intramolecular 
motion is superposed in the higher-order modes. If we 
write x^ as a sum of the center-of-mass position 
and the dye position relative to the center of mass r^, 
then the autocorrelation function is just the sum of au- 
tocorrelations. For example, 
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Similarly, we can write y* as a sum over components 
that respond to the center-of-mass motion, y'^^^; to the 
intramolecular motion, which we denote y* ; and to local- 
ization noise arising from photon counting as described 
in [29], which we denote yp. These three components are 
also uncorrelated: the first two because of the assumption 
of independence between center-of-mass and intramolec- 
ular motion, and the third because the photon counting 
process is independent of dye position. 

We now insert these decomposed terms into Eq. 4 to 
get 
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where we have defined 
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and in those definitions we have used the ~ notation to 
indicate concatenated two-time vectors as in Eq. 3. De- 
fined in this way, constitutes a tracking error arising 



from finite system response bandwidth and from local- 
ization noise and is identical to that previously described 
in [26] . The intramolecular motion therefore adds a new 
time-dependent variance term to the systematic tracking 
error variance that was characterized previously. 

We now approximate the excitation laser beam with 
a three-dimensional Gaussian, as is standard practice in 
the PCS literature. We denote the covariance of this 
Gaussian by the diagonal matrix jW, where the elements 
of — denoted w^, Wy and wl — are the squares of the 
beam waists along the x-, y-, and z-axes. Combining 
this with Eq. 6, we can compute the integrals in Eq. 3 
to write the FGS curve in terms of the matrices S"^ and 
AL„: 
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where the constant of proportionality can by found by 
direct computation of {I{t)), or more simply by requiring 
that 52(00) = 0. 

Equation 9 is suitable for any intramolecular model 
with linear dynamics. In general there may be coupling 
between the intramolecular motion along different Carte- 
sian axes; for example, dye movement can induce fiuid 
flows in the solvent that act upon adjacent dyes in a man- 
ner that must be described by a mobility tensor. How- 
ever, the models that we will work with in this paper 
do not contain this feature. In this case, we may write 
the matrices A^^ and E"^ in terms of smaller, diagonal 
matrices (denoting the 3x3 identity by Ida): 
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quantities A^^ and cr"^ must then be found from the mod- 
els chosen for the intramolecular and tracking system dy- 
namics. We must note that, in addition to requiring inde- 
pendence, when expressed in this way A^^ and S"^ also 
require that the dynamics are identical along all three 
Cartesian axes. In practice this idealization is rarely 
realized because the xy-axes and z-axis usually do not 
perform identically in our apparatus. We will therefore 
distinguish between the dynamics along diff'erent axes, 
when necessary, using the notation (AJ^^) and crj to 
indicate the x-axis, and likewise for the y- and z-axes. 

We may now write the tFCS curve in the most general 
form we will require in this paper: 
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which reduces determination of (72 (t) to the problem of 
finding A^„^^ and a'^ for the particular models chosen for 
the intramolecular dynamics and the tracking system. In 
the next section, we compute the system response to the 
motion of the tracked molecule in order both to fully 
calculate a'^ and to derive the necessary equations for 
calculation of the yz-dependent terms in A^^. 

B. Tracking system dynamics 

Earlier work by our group described the tracking stage 
as a linear control system that responds continuously to 
differences in position between it and the tracked parti- 
cle, and furthermore characterized the fluorescence fluc- 
tuations that arise due to imperfect tracking fidelity [26]. 
Here we expand upon this work by incorporating the ef- 
fect of intramolecular shape fluctuations on the tracking 
statistics. The approach taken here differs significantly 
from previous work in that we use a Langevin equation, 
rather than a Fokker-Planck equation, to compute the 
statistics we require. The advantage of this approach is 
that it is simpler to incorporate generalized intramolec- 
ular dynamics as a stochastic input to a deterministic 
tracking system than by finding and solving a Fokker- 
Planck equation that describes the joint molecular and 
tracking stage statistics. 

We let the molecule be labeled by a set of N dyes to 
which the tracking system responds; these may be the 
same as those used to compute the FCS curve, but need 
not be. Therefore we must be somewhat careful in dis- 
tinguishing between tracking dyes and FCS dyes. We let 
the tracking dyes move along a set of trajectories that we 
write as {x*jjj + z^}, where arc positions relative to the 
center of mass that fluctuate according to the molecule's 
internal dynamics. As with x*^ and previously, we 
assume that x'j^^ and z*^ are uncorrelated. 

Provided that the molecule is smaller than the rota- 
tion radius and axial modulation distance of the tracking 
laser beam, the fluorescence from any individual dye on it 
will produce a localization signal that is proportional to 
the dye's distance from the tracking fixed point [24, 30]. 
These signals add linearly for multiple dyes, so that the 
tracking system follows the average position of the N 
tracking dyes. 

We let y* be the position of the tracking stage along 
the X-axis, without loss of generality, as a function of 
time. Similarly, x*^^ and 2;^ are the x-axis components of 
their corresponding vectors. We write a linear dynamical 



system to describe the evolution of in response to these 
dye positions and to a localization noise i^* arising from 
photon counting as described in [29]: 

S' = Cq', 

where q* is an internal state variable and the matrices 
A, B, and C arc a state-space realization of the track- 
ing system dynamics. This abstract matrix formalism 
is used because the dynamics of the tracking system de- 
pend on those of all of its constituent components and, 
particularly with mechanical stages like our piezoelectric 
one, some of the system components may have nontriv- 
ial responses within the operational feedback bandwidth. 
While we can sometimes obtain satisfactory results with 
a simple first-order system (in which A, B, and C are 
scalars), often we require a two-dimensional system in 
order to account for the finite bandwidth of the mechan- 
ical stage [26]. 

We now solve Eq. 13 in terms of its inputs, 




(14) 

The first term in this equation is a transient one reflect- 
ing the state of the system at the initiation of tracking; 
we avoid consideration of such transients because the lin- 
earity assumption for the localization signal is often vi- 
olated, and the resulting nonlinear statistics are quite 
complicated [30]. Fortunately, the stability of the track- 
ing system requires A < 0, so that the transients decay 
sharply for t larger than the time constant set by |A]~^. 

We will use Eq. 14 in the steady-state limit |A]t » 1 
to compute the statistics of 7/*. This limit can only be 
interpreted when z*^ is stationary, but this will essentially 
always be the case. We assume that the center of mass 
moves by ordinary Brownian motion. The inputs 2* and 
both have mean zero, so that (y*) = For sim- 

plicity, we define our coordinate system so that = 0. 

The two-time correlation functions are more substan- 
tial. We break y' into its uncorrelated constituent parts, 
as described earlier: y* = + vl + Vf- For each part 
we write out the product, insert the appropriate corre- 
lations, compute the integrals and take the steady-state 
limit. Beginning with the center-of-mass motion, we in- 
sert the Brownian correlation (x'jjjCC*^) = 2£)min{ti, ^2} 
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and find, after some manipulation, 
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where r°° = limt^oo /q ci^e^(*-«)BBTe^^(*-« as de- 
fined in [26]. r°° may alternatively be expressed as the 
solution of the equation Ar°° + r°°A'^ = -BB"^, which 
we exploit to simplify expressions containing this term. 
The simplification to Eq. 15 also requires CA~^B = —1, 
which indicates that the tracking system has no deter- 
ministic steady-state error and is guaranteed by the use 
of an integrating controller in the feedback loop[26]. The 
r-independent term in Eq. 15 represents the lag between 
the molecule and stage positions resulting from finite 
feedback bandwidth. As |A| — >• oo the lag term goes 
to zero because the tracking system follows the molecule 
with perfect fidelity. 

Next we assume that F* is a white noise process with 
power spectral density /. Following the same procedure 
as for yields 



(16) 



and puts us in position to compute cr^ as defined in Eq. 
11. We combine Eqs. 15 and 16 to get 
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which is independent of t because all 2Dt terms stemming 
from the center-of-mass motion have been canceled. This 
is the mathematical statement of the fact that the track- 
ing system cancels the molecule's center-of-mass motion 
on time-scales longer than the tracking and intramolecu- 
lar relaxation times. 

We cannot compute the component of the stage motion 
due to z*^ until we specify the model that we will use for 
the intramolecular dynamics. Next we present a simple 
example model that that captures the essential features 
of intramolecular tracking-FCS. 



1. Example: independent harmonically hound dyes 

We consider the example of a collection of dyes bound 
by a harmonic potential to a central point that is under- 
going Brownian motion. Despite being somewhat artifi- 
cial, this example will contain all of the essential details 
of intramolecular tracking-FCS and is convenient in that 
it is exactly solvable. 

Wc let all dyes move independently of each other and 
with the same dynamics, and we require that motion 
along different Cartesian axes is uncorrelated. The dyes' 
intramolecular correlation function is a well-known result 
from the Ornstein-Uhlenbeck theory [31], 
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where /3 is the stiffness of the attractive bond to the cen- 
tral point and a? is the variance of the dye position. The 
distribution of these dyes is Gaussian, so the formula for 
52 (t) in Eq. 9 applies. 

For simplicity, we choose a first-order dynamical model 
for our tracking system, with A = —7, B = 7 and C = 1. 
We may now compute the intramolecular response matrix 
elements A^^. If the sets of FCS dyes and tracking dyes 
are distinct, we have 
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The first term in this expression is the intramolecular 
correlation function, Eq. 18. The second term arises be- 
cause tracking errors add fiuctuations to the fluorescence 
signal; it is strictly positive, indicating that in this con- 
figuration the fluorescence fluctuations are always larger 
than if the tracking system had an exact estimate of the 
center-of-mass position (as in the limit cxd). In the 

particular case where the same set of dyes are used both 
as tracking indicators and tFCS probes, the result is 



76 



/3e- 



(20) 



Here the tracking error component is strictly negative 
because the fluctuations in the dye position relative to 
the probe beam are suppressed by the tracking system. 

Figure 2 illustrates the tFCS curves for the harmonic 
model with varied values for (3, including the limit /3 — >■ 
00 representing a solid particle (this limit does not ac- 
count for rotational motion, so the particle it represents 
is densely labeled). For /3 ^ 7, the time-scales of the in- 
tramolecular motion and of the tracking system response 
are separated and the curves for the harmonic models 
approach that of the solid particle for t > (3^^ . In this 
case, the molecule's rapid internal fluctuations average 
away on time-scales relevant to the tracking system. The 
tracking system is able to follow the intramolecular mo- 
tion for smaller /3; consequently, the fluorescence fluctu- 
ations increase significantly when the tracking and probe 
dyes are distinct and decrease significantly when the dyes 
are identical. 

In the next section, we use the harmonic model to 
demonstrate features of the tFCS approach that general- 
ize qualitatively to other models of molecular motion. 



C. General features of intramolecular tracking-FCS 

Tracking-FCS shares many general properties with sta- 
tionary FCS. This section is not a survey of these familiar 
properties, but instead points out differences between the 
two approaches or features that are unique to tFCS. For a 
general survey of stationary FCS, see [4]. We also include 
a more detailed discussion of the differences between sta- 
tionary FCS, tFCS and related image correlation tech- 
niques in the supporting information. 
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FIG. 2: Example tracking-FCS curves for the harmonic 
model. Dashed curves indicate distinct tracking and probe 
dyes (Eq. 19), and solid curves indicate identical tracking 
and probe dyes (Eq. 20). The legend indicates the value of 
/3 for each curve. All curves used N = M = 1, a = 200nm, 
a" — (250nm)^ for all axes, w^y = 280nm, and = SOOnm. 



1. Center of mass statistics and tracking errors 

The independent measurement of the position of the 
tracked molecule over time is a hallmark difference be- 
tween stationary PCS and tracking techniques, allowing 
us to make very accurate diffusion coefficient measure- 
ments while simultaneously greatly suppressing the fluo- 
rescence decay due to translational motion. This is cov- 
ered extensively in [25, 26] for when the tracked molecule 
is a point particle; here we examine the effect that in- 
tramolecular motion has on the center-of-mass statistics. 

The most basic characterization of the performance 
of a particle tracking system is the size of its center-of- 
mass tracking error, which we usually define for each axis 
by cr", the variance of the stationary distribution of the 
quantity y* — x*jjj. Using the harmonically bound dye 
model with first-order tracking dynamics as an example, 
we find 

v„[,'-4.,].5+/;2+_£;i_, ,21, 

where the first two terms are familiar from point particle 
tFCS and the third results from the response of the stage 
to the intramolecular motion. Thus the intramolecular 
term adds a steady-state error that, when N is small 
and ^ 7, can actually dominate the tracking system's 
response. In some cases this may be desirable: for in- 
stance, if we were tracking the diffusion of a single point 
on a molecule as it moves within the molecule itself. In 
such cases it may be more sensible to define the tracking 
error differently, because is no longer the relevant 
target position; the derivations in this paper would facil- 
itate doing so. 
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FIG. 3: D{t), as defined in the text, evaluated for the 
harmonic model over a range of values for j3. We used 
D = l^mVs, a = 500nm, 7 = lOHz, / = 8nm//HS, iV = 1. 

While the tracking error is important as a figure of 
merit, a more general and practically useful quantity is 
the variance of the stage increment, to which we fit curves 
in order to determine 7, / and D from our data. Figure 3 
illustrates the effect of the intramolecular motion on the 
quantity D{t) = Var [y*"*"^ — ?/*] (2t)~^, defined so that 
D{oo) = D. The sigmoid-like shape of the curve for small 
/3 is what is typically observed due to pure center-of-mass 
tracking[26]. The curve peaks as a function of both r and 
/3 due to the tracking system's response to the intramolec- 
ular motion. The peak along r indicates the trade-off 
between intramolecular motion on short time-scales and 
center-of-mass motion on long time-scales. The peak 
along /3 occurs because slow intramolecular fluctuations 
(small 13) are dominated by center-of-mass motion while 
fast intramolecular fluctuations (large /3) are not tracked 
well due to latency in the feedback loop. 



2. Concentration and molecular size 

An obvious practical difference between stationary 
PCS and tFCS is that the tracking approach is not di- 
rectly sensitive to the sample concentration. The concen- 
tration determines how frequently molecules drift into fo- 
cus, but once a single molecule is detected and tracked its 
fluctuations are independent of the number of molecules 
elsewhere in the sample. By contrast, the sample con- 
centration has a large infiucnce in stationary FCS, deter- 
mining the variance of the fiuorescence fiuctuations and 
consequently the overall scaling of the FCS curve through 
the value 52(0). 

The fluorescence variance is somewhat more compli- 
cated in tFCS, as it is partially determined by tracking er- 
rors (exclusively so in point-particle tFCS). Unlike in sta- 
tionary FCS, however, the fluorescence variance in tFCS 
also contains a component due to intramolecular fluctu- 
ations. We demonstrate this using the harmonic model 
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as an example, and for simplicity we assume that the 
tracking errors are small and unaffected by intramolecu- 
lar motion, i.e. ct° ^ and N ^ oo. In the limit that 
the molecule is much bigger than the beam waists, i.e. 
w/2, the variance of the FCS curve is 
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where we have defined the effective intramolecular dye 
concentration C = M/a^ and imaging volume V = 
2~'^/'^w1.yWz- Equation 22 has exactly the same form as 
in stationary FCS except with the sample concentration 
replaced by its tFCS analog, C. 

It is straightforward to show that the tFCS variance is 
dominated by tracking errors in the small-particle limit 
a w/2, where it approaches its value for point par- 
ticles. Hence intramolecular tFCS has the same general 
property as intramolecular stationary FCS, in which the 
FCS curve is dominated by intramolecular motion for 
molecules larger than the excitation beam and by trans- 
lational motion for smaller molecules[9]. An important 
difference, however, is that 52(0) is affected — strongly so 
for large particles — by molecular size in tFCS whereas it 
is determined by the sample concentration alone in sta- 
tionary FCS [11]. This fact makes tFCS more sensitive 
to molecule size than stationary FCS and improves the 
numerical conditioning of fits to 52 (t). 



3. Sensitivity to heterogeneity 

Stationary FCS is a true single-molecule method in 
the sense that it generates fluorescence signals from only 
one molecule at a time. However, the signal from a sin- 
gle molecule is never sufficient to determine a detailed 
FCS curve because only a small number of photons are 
typically detected from each molecule, so that statisti- 
cal counting errors are very large. This is quite unlike 
tFCS, in which long observation times with the tracked 
molecule located in the brightest part of the excitation 
beam can enhance the signal-to-noise ratio by a factor of 
100 or more[28]. Tracking-FCS can therefore be used to 
determine complete FCS curves on individual molecules 
over a wide range of time-scales, providing the unique 
ability to resolve difi^erences in the dynamics of difi'erent 
molecules. We will demonstrate this in our measurements 
in the next section. 



4- Sensitivity to decays on long time-scales 

Another characteristic feature of tFCS is sensitivity 

to fluorescence decays on time-scales much longer than 
the characteristic diffusion time of a molecule through 
the laser focus. 52 (t) only decays to zero when the 
quantities X^-,^^ and cr'^ arc both approximately zero; this 
means that decays in the intramolecular term are 



detectable even if they occur at times much longer than 
the ccnter-of-mass diffusion time. Stationary FCS, by 
contrast, contains a decay of the form g2(T) oc t^^ for 
T longer than the characteristic diffusion time [4], so that 
any longer decays are sharply attenuated. 



III. APPLICATION TO DOUBLE-STRANDED 
DNA 

We now apply the theory developed in the previous 
sections to measurements on double-stranded DNA. 



A. Linear polymer models 

The simplest dynamical model for the motion of a flex- 
ible polymer chain was developed by Rouse[7, 27], and 
forms the basis for subsequent refinements by Zimm[8] 
and others of the physics incorporated into the models. 
The Rouse model describes the polymer as a sequence 
of beads, connected and held together by springs and 
driven by independent Brownian forces. The discrete set 
of equations of motion for these beads is transformed into 
a partial differential equation for the polymer backbone 
by taking a continuum limit, assuming that the RMS 
distance between beads is much smaller than the overall 
length L of the molecule. The resulting equation defines 
a time-dependent space curve R(?i, t) parameterized by 
the position u G [0,L] along the backbone contour, and is 
solved using a Fourier series expansion. Prom this solu- 
tion we find the correlation function needed to compute 
52(t)[27], 



(R(u, t + t)R{u', t)'^) = 2Dt Ids 



2r, 



2 00 



1 



9=1 



r/r. 



COS 



'irqu' 




irqu' 


I L \ 


cos 




L 



Ids, (23) 



where is the mean squared distance between the end 
and center of mass of the polymer and is the relaxation 
time of Fourier mode q, which scales as q~^ for the Rouse 
model and approximately as q~^^^ for the Zimm model. 
We provide A^/i for a polymer with probe dyes at contour 
positions {um} and tracking dyes at {vn} in Eq. S2 in 
the supporting information. 

We have already demonstrated that the Rouse model 
is sufficient to describe tFCS measurements on A-phage 
DNA, and that in fact the Zimm model is not consis- 
tent with our data[14]. We will therefore only use the 
Rouse model in our fits in this paper. The results here 
are intended to illustrate the theoretical consistency and 
capabilities of the tFCS technique and the technical as- 
pects of measurements on labeled polymers. We focus 
more on the specific quantities that tFCS is sensitive to 
than on the scientific interpretation of those quantities 
although, together with our closely-related work in [14], 
the technical developments described in this paper have 
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provided a significant advance in the field of experimental 
DNA dynamics. 



B. Labeling 

The simplest way to incorporate fluorescence labels 
into DNA is with intercalating dyes that insert them- 
selves randomly at sites approximately uniformly dis- 
tributed over the DNA backbone. It is possible to in- 
corporate many of these dyes, which makes them valu- 
able for tracking since the molecule becomes brightly flu- 
orescent with little influence of intramolecular motion on 
the dynamics of the tracking system. Intercalating dyes 
are less useful for the probe dyes which, in contrast to 
the tracking dyes, should be incorporated quite sparsely 
in order to maximize sensitivity to intramolecular mo- 
tion. The difficulty is that the dye positions are random 
and are different for each molecule; both tFCS and sta- 
tionary FCS are sensitive to these differences, but not 
sufficiently sensitive to infer the dye label configurations 
precisely. This configuration uncertainty must be prop- 
erly addressed when analyzing tFCS and stationary FCS 
data, and inevitably reduces the sensitivity of these tech- 
niques to the underlying polymer dynamics. 

An alternative labeling procedure uses methods from 
molecular biology to incorporate dyes site-specifically 
into the DNA backbone. This is more challenging, but 
provides much greater experimental sensitivity because 
the dye configurations are exactly known and are identi- 
cal for all molecules. 

We prepared three different DNA-dye conjugates for 
our experiments: one molecule with intercalating track- 
ing and probe labels; one molecule with intercalating 
tracking labels and a single probe label on the molecule's 
terminus; and one molecule with individual tracking and 
probe labels on opposite termini. We provide theoretical 
formulae for the tFCS curves for each of these molecules, 
derived from Eq. 23, in the supporting information. 

C. Materials and methods 

We made all measurements using the three- 
dimensional tracking microscope described in [28], 
which we enhanced as described in [14] by the addition 
of a second excitation laser at 444nm and a confocal 
detection channel with peak sensitivity at 480nm. 

We purchased genomic A-phage DNA (48502 bp) from 
NEB and Invitrogen and produced three different DNA- 
dye conjugates. One conjugate was labeled with the in- 
tercalating dyes POPO-3 (for tracking) and POPO-1 (for 
tFCS), both purchased from Invitrogen. Molecules with 
mean inter-dye spacings of 300 base pairs (bp) for the 
tracking dye and between 300 and 48000 bp for the tFCS 
dye were prepared by adding DNA to dilute solutions 
of dye in TE buffer (lOmM TRIS, ImM NazEDTA, pH 
8.0) and incubating at room temperature for 20 minutes. 
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FIG. 4: Tracking-FCS data for genomic A-phage DNA with 
varied label configurations. Mean values of 52(7") are indicated 
by circles, and 2a bounds by the shaded regions. Fit curves 
to the Rouse model are superimposed in black. The legend 
indicates the fit parameters and goodness-of-fit p-values as 
described in the text. The POPO-3/POPO-1 conjugate was 
prepared with POPO-1 label density of 1 dye per 24000 bp 
(for an expected Ai ~ 2 dyes/molecule). 



At these relatively low dye densities, we anticipate little 
alteration of the DNA dynamics [9]. 

One conjugate was labeled by incorporation of a single 
Atto-425 conjugated dATP (Jena Biosciences) into a ter- 
minal single-stranded overhang using Klenow exo- DNA 
polymerase (NEB). The reaction was cleaned up using 
the QIAEX II silica adsorption procedure (Qiagen) and 
repeated ultrafiltration in a Microcon YM-100 unit (Mil- 
lipore) until no free dyes were detected in solution. This 
conjugate was then labeled with POPO-3 for tracking as 
described above. 

A final conjugate was prepared by ligating the biotiny- 
lated oligonucleotide sequence 5'-GGGCCGCGACCT- 
3'-Bio (IDT) onto the free single-stranded overhang of the 
Atto425-labeled conjugate using T4 DNA ligase (NEB). 
This molecule was purified as described above and then 
mixed with an excess of streptavidin-coated quantum 
dots (qd655, Invitrogen) and incubated at room tempera- 
ture. The qdot-DNA conjugate was purified by repeated 
ethanol precipitation until few free quantum dots were 
detected in solution. 

All molecules were imaged in TE buffer with 1% v/v 2- 
mercaptoethanol added to enhance the fiuorescence yield. 



D. Measurements 

Figure 4 shows (t) measured from the three DNA- 
dye conjugates along with fits to theoretical curves for 
the Rouse polymer model in which the polymer param- 
eters Ti and ro were fit for all three curves and the 
mean number M of intercalating dyes was also fit for 
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the POPO-l/POPO-3 curve. All fits used a first-order 
tracking system model with = 15Hz, 7^ = 2Hz, 
(jO = (7° = (lOOnm)^ and (7° = (250nm)2 as deter- 
mined from D{t) as described in Section II CI. All 
fits were corrected for attenuation due to background 
counts [14]. The fits for the POPO-3 labeled molecules in- 
clude high-frequency triangle-wave oscillations account- 
ing for our excitation scheme, described in [14], by which 
we alternatingly expose the molecules to the two exci- 
tation lasers to prevent cross-talk between tracking and 
probe fiuorescence. Furthermore, all fits span the range 
lO^'^s < r < lO^^s due to interference by a systematic 
decay in our data spanning 0.2s < r < Is, likely caused 
by an imperfection in the z-axis tracking[14]. Although 
this systematic decay is very small compared to the pri- 
mary tFCS decay, the high statistical resolution on these 
longer time-scales would place too much emphasis on this 
spurious component of the tFCS curves if it were included 
in the fits. 

We evaluated the goodness of the fits to the data by as- 
suming that the statistical noise on each of the measured 
curves is Gaussian, so that the sum of the squared resid- 
uals has a distribution. The size of the statistical un- 
certainty in the Atto425/POPO-3 and Atto425/Qdot655 
conjugates was determined by computing the variance in 
(72 (t) over the set of observed molecules. The uncertainty 
in the POPO-l/POPO-3 conjugate was determined us- 
ing the bootstrap method after processing the data to 
account for dye configuration uncertainty, as described 
later in this section. 

All three curves fit the data satisfactorily from a sta- 
tistical perspective, as indicated by the p-values given in 
the figure. The large p-values for the P0P0-3/P0P0- 
1 and Atto425/Qdot655 conjugates suggest that our es- 
timates for the noise in these curves are probably too 
large. The consequence of over-estimated uncertainty is 
reduced ability to discriminate between different models 
or between different parameter values. Fortunately, we 
showed in our model discrimination experiments that the 
difference between the Rouse and Zimm models was great 
enough to resolve using tFCS on the Atto425/POPO-3 
conjugate[14]. 

The POPO-l/POPO-3 and Atto425/POPO-3 conju- 
gates yielded fairly consistent values for tq and ri. As 
discussed in [14] these fit parameters, together with 
the measured translational diffusion statistics (we find 
D = 0.8/im^/s), are internally consistent with the pre- 
dictions from basic polymer theories. By contrast, both 
fit parameters for the Atto425/Qdot655 conjugate are 
consistent with those of a molecule that is smaller by a 
factor of 3. We attribute this difference to non-specific 
interactions between the DNA and the streptavidin on 
the qdot surface. Interestingly, such interactions would 
be very difficult to rule out using stationary FCS because 
of that technique's insensitivity to molecular size, so that 
the unfortunate problems with this conjugate highlight a 
strength of our technique. 

While data from the Atto425/POPO-3 and 




Mean dye spacing [bp] 



FIG. 5: Sensitivity to labeling heterogeneity. The measured 
variances of (72(0) within individual DNA samples (blue) are 
compared to the variances predicted by simulations (red) 
of ensembles of polymers with uniformly distributed dyes. 
Dashed lines indicate the shot-noise variances for 15s track- 
ing times with 2.5kHz count rates (upper), and 40s tracking 
times with 8kHz count rates (lower). Experimental tracking 
times averaged 25s and count rates averaged 3.8kHz. 95% 
error bars were determined by bootstrap sampling. 



Atto425/Qdot655 conjugates were both straightfor- 
ward to analyze, the sensitivity of tFCS to the precise 
positions of dyes in the POPO-l/POPO-3 conjugate 
demands a more sophisticated approach. For example, 
Eq. 23 predicts both a slower relaxation time and a 
larger relaxation amplitude for a molecule with a single 
dye on its end {u = 0) compared to a single dye in the 
middle of the chain [u = L/2). Briefly, we account 
for configuration uncertainty in the POPO-l/POPO-3 
conjugate by treating each measured 92 (t) as the sum of 
an ensemble average curve (7*(t), a contribution due to 
dye configuration 77 (r), and a statistical noise term ^(r). 
We estimate the value of 5*(t) from a set of measured 
92{t) by using a maximum- likelihood estimator; the data 
in Fig. 4 is the output of this estimator. Furthermore, 
since it is difficult to compute (7*(t) exactly for the 
purpose of fitting, we determine the fit curve in the 
figure by generating 1000 random dye configurations at 
a fixed dye density and averaging their tFCS curves. 
Complete details regarding the maximum-likelihood 
estimator are given in the supporting information. 

We performed experiments on POPO-l/POPO-3 con- 
jugates over a wide range of POPO-1 densities in order 
to fully demonstrate the ability of tFCS to resolve dif- 
ferences between molecules based on their dye configu- 
rations. During these experiments we varied the probe 
excitation intensity in order to keep the average count 
rate, and therefore the measurement noise, roughly con- 
stant. The variations between molecules can be evalu- 
ated most simply by computing the variance of the mea- 
sured tFCS curves: expressing the measured curves as 
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a sum of .9*(t), ?7(r) and ^(t) as described above, we 
have Var[g2(T)] = Var[77(r)] + Var[^(T)]. Since the mea- 
surement noise is constant, any differences in Var[(72(''')] 
observed in these experiments must derive from the de- 
pendence of ?7(t) on dye label density. Figure 5 shows 
the results of these measurements. As expected, at suffi- 
ciently low dye densities (large dye spacings) the observed 
variations between molecules exceed the values predicted 
for measurement noise alone and therefore suggest that 
we are indeed observing heterogeneity within our sample 
due to differences in dye configuration. 

In order to evaluate the observed heterogeneity quanti- 
tatively, we determined the expected variance as a func- 
tion of dye spacing by using Monte Carlo simulations. 
These simulations suggest that our observed variances 
are about a factor of 3 smaller than expected. There are 
several possible sources of this discrepancy. One; is that 
our prepared dye densities may differ from the true densi- 
ties, as would be the case for example if some of our DNA 
were adsorbed to the walls of our sample tubes. However, 
given that the fitted M in Fig. 4 roughly matches the 
prepared density and that the variances in Fig. 5 appar- 
ently saturate at similar dye densities, we suspect this is 
not a major source of error. Another possibility is that 
our data suffers from selection bias, because we choose 
to quantitatively examine only fluorescence trajectories 
for which we clearly see both a POPO-1 and a POPO- 
3 signal. This selection process is necessary to ensure 
that dim molecules, which have very poor signal-to-noise 
and signal-to-background ratios, do not contribute too 
much noise to our measurements. However, as a conse- 
quence of selecting data this way, we discard data from 
molecules that are either labeled with very few dyes or 
arc labeled more toward their ends than in their centers 
(because labels near the ends tend to remain further from 
the molecule's center of mass and, therefore, the probe 



beam focus). Both exclusions will tend to lower the ob- 
served variances, although it is difficult to be certain of 
the extent to which they have done so in our measure- 
ments. 



IV. CONCLUSION 

We derived the FCS statistics measured on large 
molecules tracked via feedback control. We demonstrated 
several distinguishing features of these statistics by using 
a pair of simple molecular models, and demonstrated the 
application of the tracking-FCS technique to the study 
of large double-stranded DNA molecules. These results 
demonstrated our ability to recover realistic and con- 
sistent parameters for the dynamics of DNA labeled in 
two different ways, as well as to identify alterations in 
the dynamics of the molecule due to attachment of a 
streptavidin-coated quantum dot label. Furthermore, we 
demonstrated the ability to resolve configuration differ- 
ences between different molecules in a sample subjected 
to a random labeling scheme. We have demonstrated 
that tFCS is a method with \miquc capabiliitcs; we be- 
lieve that these capabilities will establish it as a valuable 
tool for a range of difficult problems in molecular bio- 
physics. 
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